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Abstract 

We  combine  a  parallel  algorithm  for  the  two  processor  scheduling  problem, 
which  runs  in  polylog  time  on  a  polynomial  number  of  processors,  with  an 
algorithm  to  find  transitive  orientations  of  graphs  where  they  exist.  Both 
algorithms  together  solve  the  maximum  clique  problem  and  the  minimum 
coloring  problem  for  comparability  graphs,  and  the  maximum  matching 
problem  for  co-comparability  graphs.  These  parallel  algorithms  can  also  be 
used  to  identify  permutation  graphs  and  interval  graphs,  important  subclasses 
of  perfect  graphs. 
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1  introduction 


We  present  several  parallel  algorithms  for  graph  problems,  in  particular  for 
perfect  graphs.  Our  main  result  is  a  deterministic  AfC  algorithm  for  solving  the 
two  processor  unit  execution  time  scheduling  problem,  answering  an  important 
open  problem  posed  in  [24].  We  also  present  an  AfC  algorithm  for  transitively 
orienting  comparability  graphs.  By  combining  these  two  results,  we  obtain  an  AfC 
algorithm  for  the  matching  problem  on  co-comparability  graphs  (the  complements 
of  comparability  graphs)  and  nearly  co-comparability  graphs.  In  addition  our 
transitive  orientation  algorithm  gives  us  AfC  algorithms  for  several  additional 
problems,  such  as  identifying  permutation  graphs  and  finding  the  maximum 
weighted  clique  and  optimal  colorings  in  comparability  graphs.  Comparability, 
co-comparability,  and  permutation  graphs  are  all  important  subclasses  of  perfect 
graphs. 

The  most  fundamental  scheduling  problems  involve  unit  time  execution  tasks 
with  precedence  constraints  restricting  the  order  of  execution.  When  the  number 
of  processors  varies,  the  scheduling  problem  is  jV'P-complete  [23].  There  are  no 
published  polynomial  time  algorithms  for  a  fixed  number  of  processors  greater  than 
two.  The  first  polynomial  time  algorithm  for  the  two  processor  case  was  published 
in  1969  [5].  Paster  algorithms  for  the  same  problem  were  given  by  Coffman  and 
Graham  [2],  and  later,  Gabow  [6,7]  found  an  asymptotically  optimal  algorithm. 
Recently,  Vazirani  and  Vazirani  have  published  a  randomized  parallel  solution 
[24].  Like  Fujii  et  al.  they  use  the  connection  between  matching  and  two  processor 
scheduling,  so  their  algorithm  relies  on  an  TtAfC  matching  subroutine  such  as  [14] 
or  [18]. 

In  contrast,  our  scheduling  algorithm  [12]  is  deterministic  and  does  not  require 
the  aid  of  a  matching  subroutine.  Therefore  we  are  able  to  exploit  the  relationship 
between  matching  and  two  processor  scheduling  in  the  other  direction,  obtaining  a 
deterministic  parallel  maximum  matching  algorithm  for  co-comparability  graphs. 

The  only  ingredient  required  to  convert  our  scheduling  algorithm  into  a 
matching  result  is  an  AfC  transitive  orientation  subroutine.  This  routine  takes 
an  undirected  graph  and  directs  the  edges  so  that  the  resulting  digraph  is 
transitively  closed.  Those  graphs  which  can  be  transitively  oriented  are  called 
comparability  graphs.  The  complements  of  comparability  graphs  are  co-compa¬ 
rability  graphs.  Kozen,  Vazirani  and  Vazirani,  in  independent  work,  coupled  a 
different  transitive  orientation  routine  with  our  two  processor  scheduling  algorithm 
to  achieve  an  AfC  matching  algorithm  on  co- comparability  graphs  [16].  Our 
transitive  orientation  subroutine  is  also  the  key  element  in  our  algorithms  for 
testing  for  permutation  graphs  and  finding  maximum  weighted  cliques  and  optimal 
colorings  on  comparability  graphs. 


2  Main  Theorems  and  Applications 

In  this  section  we  give  some  definitions,  state  our  main  results,  and  prove  several 
important  consequences. 

As  our  model  of  parallel  computation,  we  use  the  Parallel  Random  Access 
Machine  or  PRAM  as  defined  in  [4].  A  PRAM  consists  of  an  unbounded  number 
of  identical  processors  running  synchronously.  Each  such  processor  can  be  thought 
of,  for  the  purpose  of  this  paper,  as  an  ordinary  RAM  [1],  with  local  memory.  A 
PRAM  also  contains  an  unbounded  number  of  global  memory  cells  which  every 
processor  can  access  in  one  timestep.  We  allow  that  several  processors  read  the 
same  memory  cell  simultaneously.  However,  several  processors  must  not  write 
simultaneously  to  the  same  memory  cell.  Every  processor  has  stored,  in  one 
of  its  registers,  its  unique  processor  index.  All  processors  execute  the  same 
program.  Since  the  instructions  may  depend  on  the  processor  index,  the  effect 
of  an  instruction  will  in  general  vary  from  processor  to  processor. 

When  measuring  the  complexity  of  parallel  algorithms  (for  the  PRAM  model), 
we  are  mainly  interested  in  the  amount  of  time  an  algorithm  uses,  and  the  number 
of  processors  it  employs.  Time  will  be  the  number  of  parallel  steps  taken  by  the 
PRAM,  and  the  number  of  processors  will  be  the  highest  index  of  a  processor 
active  in  the  computation. 

The  class  of  parallel  algorithms  running  in  time  which  is  bounded  by  a 
polynomial  in  the  logarithm  of  the  size  of  the  input,  and  using  a  number  of 
processors  polynomial  in  the  input  size,  has  experienced  considerable  interest.  One 
reason  is  that  the  algorithms  in  this  class  are  considered  very  fast  (the  “speedup” 
over  their  sequential  counterparts  is  exponential),  and  they  use  a  “reasonable” 
amount  of  hardware,  i.e.  processors.  Another  reason  is  that  this  class  is  very 
robust  under  (reasonable)  variations  in  the  definitions  of  the  underlying  machine 
model.  The  class  is  commonly  referred  to  as  NC,  owing  to  its  original  definition 
for  the  boolean  circuit  model  of  parallel  computation  in  [20]. 

A  perfect  graph  is  an  undirected  graph  where  the  chromatic  number  and 
maximum  clique  size  of  every  induced  subgraph  coincide.  A  precedence  graph 
is  an  acyclic,  transitively  closed  digraph.  We  use  (a,  6}  to  denote  an  undirected 
edge,  and  (a,  6)  to  denote  a  directed  edge  or  ore  from  vertex  a  to  vertex  b.  Thus 
if  arcs  (a,  b)  and  (b,  c)  are  in  a  precedence  graph,  then  so  is  the  arc  (a,c).  A 
comparability  graph  is  an  undirected  graph  with  the  property  that  every  edge  can 
be  assigned  a  direction  such  that  the  resulting  graph  is  a  precedence  graph.  The 
complement  of  a  comparability  graph  is  a  co-comparability  graph.  Precedence 
graphs  are  equivalent  to  partial  orders.  Some  graphs,  such  as  a  simple  three-cycle, 
are  both  comparability  and  co-comparability  graphs. 

The  undirected  graph  G  —  (V,  E)  is  a  permutation  graph  if  there  exists  a 
pair  of  permutations  on  the  vertices  such  the  edge  6  E  if  and  only  if 


v  precedes  v'  (or  v'  precedes  v)  in  both  permutations.  Permutation  graphs  are 
equivalent  to  the  comparability  graphs  of  partial  orders  with  dimension  two.  A 
graph  is  both  a  comparability  graph  and  a  co-comparability  graph  if  and  only  if 
it  is  a  permutation  graph  [21].  Permutation  graphs,  comparability  graphs  and 
co-comparability  graphs  are  all  subclasses  of  perfect  graphs  [9]. 

An  instance  of  the  two  processor  scheduling  problem  is  given  by  a  precedence 
graph  G  =  (V,E).  Each  vertex  represents  a  task  whose  execution  requires  unit 
time  on  either  of  two  identical  processors.  If  there  is  a  directed  edge  from  task  t  to 
task  t',  then  task  t  must  be  completed  before  task  t'  can  be  started.  A  schedule  is 
a  mapping  from  tasks  to  integer  timesteps  such  that  at  most  two  tasks  are  mapped 
to  each  timestep  and  for  all  tasks  t  and  t'  if  t  must  precede  t'  (t  -<  t')  then  t  is 
mapped  to  an  earlier  timestep  than  t' .  The  length  of  a  schedule  is  the  number  of 
timesteps  used.  An  optimal  schedule  is  one  of  shortest  length. 

The  maximum  matching  problem  on  co-comparability  graphs  and  the  two 
processor  scheduling  problem  are  closely  related.  If  G  is  a  co-comparability  graph 
and  G  is  a  transitive  orientation  of  G’s  complement,  then  the  pairs  of  tasks  mapped 
to  the  same  timestep  in  an  optimal  two  processor  schedule  of  G  correspond  to 
a  maximum  matching  in  G.  Furthermore,  there  is  a  jquential  algorithm  for 
converting  any  maximum  matching  for  G  into  an  optimal  two  processor  schedule 
for  3  [5].  In  [24]  it  was  conjectured  that  this  process  is  inherently  sequential,  but 
with  our  two  processor  scheduling  algorithm  it  can  be  solved  quickly  in  parallel. 

Theorem  1:  Two  processor  scheduling  is  in  AfC. 

Proof:  We  outline  an  O(log2  n)  time  algorithm  in  section  3.  Further  details  can 
be  found  in  [12].  □ 

Theorem  2:  There  is  an  A fC  algorithm  which  detects  if  an  undirected  graph  is 
transitively  orientable,  and  if  so  finds  a  transitive  orientation. 

Proof:  We  present  such  an  algorithm  in  section  4.  See  also  [16].  0 

Corollary  2.1:  There  is  an  AfC  algorithm  which  detects  whether  or  not  a  graph 
is  a  permutation  graph. 

Proof:  Graph  G  is  a  permutation  graph  if  and  only  if  both  G  and  G 

are  comparability  graphs  [21].  Therefore,  by  running  our  transitive  orientation 
algorithm  on  both  G  and  (5,  we  can  determine  if  G  is  a  permutation  graph.  0 

Corollary  2.2:  There  is  an  AfC  algorithm  which  finds  a  maximum  node-weighted 
clique  in  comparability  graphs. 


Proofs  Given  &  comparability  graph  G,  we  find  a  transitive  orientation,  G. 
Examine  any  jb-path  in  (5.  A  Jb-path  is  a  directed  path  containing  exactly  k 
vertices.  Because  G  is  transitively  closed,  the  nodes  on  the  fc-path  form  a  fc- clique 
in  G.  Similarly,  every  k  clique  in  G  is  a  Jb-path  in  G.  Thus  the  problem  of  finding 
a  mitYirrmm  node- weighted  clique  in  G  reduces  to  finding  a  maximum  weight  path 
in  <?.  Since  Cr  is  a  DAG,  standard  parallel  techniques  (*.e.,  max-plus  closure)  can 
be  used  to  find  a  heaviest  path  in  G.  Q 

Corollary  2.3:  There  is  an  MC  algorithm  which  finds  a  minimal  node-coloring 
of  comparability  graphs. 

Proof:  Given  a  comparability  graph  G,  we  find  a  transitive  orientation,  G.  We 
say  that  a  vertex  v  is  on  level  t  in  G  if  the  longest  (directed)  path  from  v  to  a 
sink  contains  exactly  i  vertices.  Clearly  any  pair  of  nodes  on  the  same  level  are 
not  adjacent  in  G,  so  they  can  be  assigned  the  same  color.  Every  node  on  level 
*  >  1  is  a  predecessor  of  at  least  one  node  on  level :  —  1.  Therefore,  if  G  has  k 
levels  then  G  has  a  path  of  length  k  and  G  has  a  fc-clique.  Since  no  coloring  can 
use  fewer  colors  than  the  size  of  the  largest  clique,  using  a  distinct  color  for  every 
level  yields  an  optimal  coloring.  Q 

Theorem  3:  Finding  maximum  matchings  on  co-comparability  graphs  is  in  A fC. 

Proof:  One  such  algorithm  is  given  in  section  5.  0 

This  theorem  is  extended  to  nearly  co-comparability  graphs  in  section  5. 

Corollary  3.1:  Maximum  matchings  on  permutation  graphs  and  partial  orders 
of  dimension  2  can  be  constructed  in  AfC. 

Proof:  As  stated  above,  these  graphs  are  co-comparability  graphs.  □ 

Corollary  3.2:  Maximum  matchings  on  interval  graphs  can  be  fotmd  in  MC. 
Proof:  Interval  graphs  are  a  subclass  of  co-comparability  graphs  [8].  0 

3  Two  Processor  Scheduling 

In  this  section,  we  consider  the  scheduling  problem  for  task  systems  with  arbitrary 
precedence  constraints,  unit  execution  time  per  task,  and  two  identical  processors. 
Our  scheduling  algorithm  for  this  problem  is  built  around  a  routine  that,  for  any 
precedence  graph,  computes  the  length  of  the  graph’s  optimal  schedule(s).  This 
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Figure  1:  This  is  a  precedence  graph  containing  fifteen  tasks  (transitive  arcs  have 
been  omitted).  The  special  tasks  tu,v  and  Uot  are  added  when  computing  the  length 
of  optimal  schedules  for  G.  The  levels  of  the  original  graph  are  on  the  left  and  the 
jump  sequence  is  on  the  right. 


length  routine  is  applied  repeatedly  in  order  to  actually  find  an  optimal  schedule 
for  the  input  graph. 

Let  G  =  (V,  -<)  be  the  precedence  graph  we  are  interested  in.  If  t  -<  t'  then  t  is 
a  predecessor  of  t'  and  t'  is  a  successor  of  t.  For  any  pair  of  tasks,  t,  t'  6  V,  define 
Vf,  to  be  the  set  of  tasks  which  are  both  successors  of  t  and  predecessors  of  t',  and 
let  G\,  be  the  subgraph  of  G  induced  by  Ft5.  The  schedule  distance  between  tasks 
t  and  t',  SD(t,t'),  is  defined  to  be  the  length  of  any  optimal  schedule  for  G\>.  If 
t  t'  then  SD(t,  t')  =  0. 

Lemma  3.1:  Let  t,t'  €  V,  and  let  S  be  a  set  of  tasks  such  that  for  all  i  €  5: 
i.  t  -<  f  -<  f; 


<*o(*,*)  :=  0; 

for  t  :=  1  to  flogn]  do 

for  all  t,  t1  with  t  -<  i'  do  in  parallel 
for  all  0  <  k,  l  <  n  —  1  do  in  parallel 

:=  {s  :  t  ■<  s  ■<  t', 

di-\(t,s)  >  k , 
di-x(s,t')  >  /}; 

*(*’*') := 

k  + 1  +  riV.wl/2]}; 

SD(+,  ★)  :=  dfio, „•)(★,  *) 

Figure  2:  The  Distance  Algorithm. 

iii.  SD >  /. 

Then  SD(t,t')  >  Ik  + 1  +  f|5|/2l. 

Proof:  Count  the  number  of  timesteps  required  to  schedule  those  tasks 

between  t  and  t‘.  There  must  be  at  least  k  timesteps  before  the  first  task  in  S 
is  scheduled.  It  takes  at  least  |S|/2  timesteps  to  complete  the  tasks  in  5.  After 
the  last  task  in  5  has  been  completed,  at  least  l  additional  timesteps  are  required. 
Therefore  SD(t,  ?)>k  +  l  +  |S|/2.  0 

The  distance  algorithm  (see  Figure  2)  uses  a  doubling  method  like  transitive 
closure  to  compute  the  schedule  distances  between  all  pairs  of  tasks  in  a  precedence 
graph  G  =  (V,  -<).  It  initially  guesses  that  the  scheduling  distance  between  each 
pair  of  tasks  is  at  least  zero.  By  repeatedly  applying  Lemma  3.1  to  each  pair 
of  tasks  in  parallel  the  algorithm  refines  its  guesses.  Below  we  prove  that  after 
log  |V|  iterations,  the  algorithm’s  guess  for  each  pair  of  tasks  has  converged  to  the 
schedule  distance.  The  distance  algorithm  has  a  straightforward  implementation 
on  an  n5  processor  PRAM  taking  O(log3  n)  time. 

Lemma  3.2:  The  distance  algorithm  always  computes  the  schedule  distance 
between  every  pair  of  tasks. 

Proof:  Lemma  3.1  guarantees  that  the  distances  computed  by  the  algorithm  are 
never  greater  than  the  the  schedule  distances. 

In  [2]  it  is  shown  how  to  construct  sets  of  tasks  Xo,  Xi»  •  •  •  >  X*  for  any  precedence 
graph  such  that: 

•  Those  tasks  in  any  x<  are  predecessors  of  all  tasks  in  x;-i>  and 

•  The  length  of  optimal  schedules  for  G  is  Ci  fix«l/2]  (See  Figure  3). 
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Figure  3:  This  is  an  LMJ  schedule  for  the  graph  in  Figure  1;  each  x.  hoxed. 

Our  algorithm  does  not  compute  the  x.’s  directly,  we  simply  use  their  existence  to 
prove  that  the  distances  the  algorithm  does  compute  converge  to  the  scheduling 
distance. 

Examine  how  the  distance  algorithm  determines  the  schedule  distance  between 
an  arbitrary  pair  of  tasks,  t  and  Let  Xi.Xa*— »Xfc  be  a  set  of  x.’s  for  <?{», 
Xm-i  =  {<},  and  xo  =  {t*}.  After  the  first  iteration  of  the  outer  loop,  the  distance 
computed  between  any  task  in  x«  and  one  in  x.-2  is  at  least  f|x«— 1 1/2] .  After 
the  second  iteration,  the  distance  computed  between  any  task  in  x«  and  any  task 
in  x«-4  is  at  least  fxi-i/2]  +  fx»— 2/2*)  +  [xi-3/2].  This  is  an  easy  consequence 
of  Lemma  3.1  with  S  =  Xi-2,  k  =  [x._i/2"],and  /  =  [xi-3/2].  In  each  iteration 
we  double  the  number  of  x.’s  accounted  for.  After  log  h  iterations,  the  computed 
distance  between  <  and  t'  is  at  least  the  optimal  schedule  length  for  G\,y  and  thus 
at  least  SD(t,t'). 

Since  G  contains  n  tasks,  each  G\,  has  at  most  n  —  2  x.’s-  Therefore,  after 
flog  n]  iterations  the  algorithm  has  converged  to  the  schedule  distances  for  each 
pair  of  tasks.  Q 

The  distance  algorithm  can  be  used  to  compute  the  length  of  optimal  schedules 
for  a  graph.  Augment  the  graph  with  two  dummy  tasks,  ttop  and  fbot>  which  are  a 
predecessor  and  successor  (respectively)  of  all  other  tasks  in  G.  Now  SD(ttop,  <bot) 
is  the  length  of  G's  optimal  schedules,  and  can  be  found  using  the  distance 
algorithm. 

The  method  for  converting  the  distance  algorithm  into  one  which  finds  an 
optimal  schedule  involves  several  constructions.  For  the  sake  of  brevity  this  paper 
contains  only  an  outline  of  our  method.  Interested  readers  may  consult  [12]  for  a 
more  detailed  presentation. 

The  search  for  an  optimal  schedule  can  be  restricted  to  the  class  of 
Lexicographically  Maximal  Jump  (LMJ)  schedules.  Each  task  t  in  the  precedence 
graph  is  assigned  a  level  equal  to  the  number  of  tasks  in  the  longest  path  from  t  to 
a  sink.  A  level  schedule  gives  preference  to  tasks  on  higher  levels.  More  precisely, 


suppose  levels  L, ...,/  + 1  have  already  been  scheduled  and  there  are  k  unscheduled 
tasks  remaining  on  level  l.  If  k  is  even  a  level  schedules  puts  the  k  tasks  in  pairs, 
and  there  is  no  jump  from  level  /.  If  k  is  odd,  a  level  schedule  pairs  k  —  1  of  the 
tasks  with  each  other  and  the  remaining  task  is  paired  with  a  task  from  a  lower 
level  V  <  l.  In  this  case,  level  l  jumps  to  level  V .  We  assume  that  there  are  an 
unlimited  number  of  dummy  tasks  on  level  0  which  can  be  paired  with  any  other 
tasks.  The  jump  sequence  of  a  level  schedule  is  the  sequence  of  levels  jumped  to, 
listed  in  the  order  in  which  the  jumps  occur  (see  Figure  1).  The  Lexicographically 
Maximum  Jump  (LMJ)  sequence  is  the  jump  sequence  (resulting  from  some  level 
schedule)  that  is  lexicographically  greater  than  any  other  jump  sequence  resulting 
from  a  level  schedule.  An  LMJ  schedule  is  a  level  schedule  whose  jump  sequence  is 
the  LMJ  sequence.  Note  that  our  definition  of  LMJ  is  similar  to  the  definitions  of 
highest  level  first  in  [6]  and  [24].  The  following  theorem  establishes  the  importance 
of  LMJ  schedules. 

Theorem  4:  [6]  Every  LMJ  schedule  is  optimal.  0 

Our  two  processor  algorithm  uses  the  distance  algorithm  to  find  the  LMJ 
sequence  and  which  jump  (if  any)  a  pair  of  tasks  can  be  used  for.  In  general, 
there  will  be  many  possible  pairs  for  each  jump.  A  path  doubling  computation 
finds  a  consistent  set  of  task  pairs  for  the  jumps.  The  remaining  tasks  are  paired  up 
within  levels.  Since  there  are  never  precedence  constraints  between  two  tasks  on 
the  same  level,  this  pairing  can  be  done  arbitrarily.  An  LMJ  schedule  is  obtained 
by  sorting  the  resulting  set  of  task  pairs  (both  for  jumps  and  within  levels).  We 
refer  the  reader  to  [12]  for  a  complete  description  of  the  technically  more  involved 
parts  of  this  construction. 

4  Transitive  Orientation 

The  transitive  orientation  problem  is  nontrivial  because  some  edges  cannot  be 
oriented  independently.  If  the  edges  {a,  6}  and  {b,c}  are  in  the  graph  to  be 
oriented,  but  the  edge  {a,  c}  is  not,  then  the  edges  {a,  b},  {6,  c}  cannot  be  oriented 
independently.  If  we  choose  the  arc  (a,  b)  then  we  are  forced  to  include  the  arc 
(6,c)  in  the  transitive  orientation  (see  Figure  4).  The  binary  relation  r  reflects 
this  simple  kind  of  forcing.  [21].  Given  G  =  (V,E),  we  say  that  (a,b)r(a,c)  and 
(fc,a)r(c,a)  whenever  {a,  6}  G  E,  {a,c}  G  E  and  {6,c}  £  E. 

The  reflexive,  transitive  closure  of  r,  r*,  is  an  equivalence  relation  on  the 
possible  orientations  of  edges  in  E.  For  obvious  reasons,  we  call  these  equivalence 
classes  implication  classes.  If  A  is  a  set  of  axes  (e.g.  an  implication  class)  then  A 
denotes  the  set  of  undirected  edges  {{a,  6}  :  (a,  6)  €  A  V  (6, a)  G  A},  and  A-1  is 
the  set  of  arcs  {(b,  a)  :  (a,  b)  G  A}.  A  set  of  arcs  A  is  consistent  if  A  0  A-1  =  0, 
and  is  inconsistent  when  A  H  A-1  ^  0. 
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(6,  o)r(fe,  c)  (d,  e)r (/,  e)r . . .  r(e,  d) r . . .  r(e,  /) 

Figure  4:  Graphs  and  Implication  Classes 

Implication  classes  have  been  studied  by  M.C.  Golumbic  and  many  of  the 
lemmas  in  this  section  have  originally  been  shown  in  [9]  or  [10]. 

Lemma  4.1:  If  A  ±  B  are  implication  classes  of  G  then  either  A  =  B~l  or 
AHB  =  0. 

Proof:  Assume  that  {a,  b}  €  A  fl  B.  Without  loss  of  generality,  let  (a,  b )  6  A.  If 
(a,  6)  6  B  then  B  =  A  since  implication  classes  are  equivalence  classes.  Therefore 
(6,  a)  €  B,  and  (6,  a)  ^  A.  By  definition,  if  (a,  6)r(a',  b')  then  (6,  a)r(6',  a').  Thus 
some  (c,d)r*(a,6)  if  and  only  if  (d,  c)r*(6,a),  so  A  =  B-1.  □ 

Given  an  undirected  graph  G\  =  (V,  E)  pick  any  implication  class  B\,  delete 
Bi  from  Gi,  forming  G2  —  (V,  E—  B\).  Next  form  G3  by  removing  the  underlying 
set  B2  of  some  implication  class  B2  of  G2.  Continue  the  process  until  removing 
Bk  from  Gk  results  in  a  graph  with  no  edges.  The  sequence  of  implication  classes 
removed,  Bi,B2,  . . . ,  Bk,  is  called  a  T -decomposition  of  G.  The  following  theorem 
points  out  the  usefulness  of  T-decompositions. 

Theorem  5:  (TRO  Theorem  [9])  Let  B\,B2, . . .  ,Bk  be  a  F-decomposition  of  an 
undirected  graph  G.  The  following  statements  are  equivalent: 

i.  G  is  a  comparability  graph. 

ii.  Every  implication  class  of  G  is  consistent. 

iii.  Each  Bi  in  the  T -decomposition  is  consistent. 

Furthermore,  when  these  conditions  hold,  Bi  U  B2  U  . . .  U  Bk  is  a  transitive 
orientation  of  G. 
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Proof:  The  proof  of  this  theorem  requires  several  technical  lemmas,  and  thus  is 
beyond  the  scope  of  this  paper.  The  interested  reader  is  referred  to  [9,10].  0 

Let  A  be  any  implication  class  of  the  graph  G.  Then  we  call  the  underlying 
set  of  edges  A  its  color  class.  The  TRO  theorem  suggests  a  sequential  algorithm 
for  finding  transitive  orientations  of  comparability  graphs.  One  can  take  any  edge, 
orient  it  arbitrarily,  find  the  associated  implication  class,  add  the  implication  class 
to  the  transitive  orientation  and  remove  its  color  class  from  the  comparability 
graph.  Repeating  this  procedure  yields  a  r -decomposition  of  the  comparability 
graph  and  therefore  a  transitive  orientation.  This  is  essentially  the  algorithm  in 
[21]. 

If  we  are  dealing  with  a  comparability  graph  it  is  sufficient  to  consider  color 
classes  instead  of  implication  classes,  since  every  color  class  A  represents  and 
implication  class  A  and  its  inverse  A-1.  When  talking  about  color  classes  we 
always  assume  that  the  corresponding  implication  classes  are  consistent. 

In  order  to  parallelize  the  sequential  algorithm  above  it  is  neccessary  to 
understand  how  color  classes  change  during  a  T-decomposition.  We  will  see  below 
that  the  changes  are  very  simple:  color  classes  are  either  merged  with  other  color 
classes  or  remain  unchanged. 

Lemma  4.2:  Let  B  be  an  color  class  of  G  =  (V,  E).  Every  implication  class  of 
G'  =  (V,  E  —  B)  is  the  union  of  color  classes  of  G. 

Proof:  The  r  relation  for  G',  restricted  to  E  —  B,  contains  the  corresponding 
restriction  of  the  r  relation  for  G.  0 

The  three  edges  of  a  triangle  in  the  undirected  graph  G  form  a  iricolorcd 
triangle  if  they  belong  to  three  distinct  color  classes.  We  say  that  two  color  classes 
A  and  B  are  triangle  related ,  written  A&B,  if  there  is  a  tricolored  triangle  in  G 
with  one  edge  in  A  and  another  edge  in  B. 

Lemma  4.3:  Let  A  and  B  be  two  distinct  color  classes  in  G  =  (V,  E).  A  is  not 
an  implication  class  of  G'  =  (V,  E  —  B)  iff  A&B. 

Proof:  The  proof  is  a  simple  consequence  of  the  definition  of  the  r  relation.  It 
will  be  omitted  here.  0 

An  immediate  implication  of  Lemma  4.3  is 

Lemma  4.4:  Let  the  color  classes  BXt  ...  ,Bk  of  G  =  (V,  E)  be  an  independent 
set  under  the  a  relation.  Then  in  G'  —  (V,  E  —  Bi),  the  collection  {B2, . . . ,  B*}  is 
an  independent  set  under  a. 

Corollary  4.4.1:  If  color  classes  B|, . . .  ,B*  ofG  form  an  independent  set  under 
the  a  relation ,  then  they  are  the  first  k  color  classes  for  a  F -decomposition  of  G. 


Proof:  Follows  from  the  definition  of  independent  set.  0 

Lemma  4.5:  Let  B\, . . .  ,£*  be  a  maximal  independent  set  under  the  a  relation 
for  some  graph  G\  =  (V,  E).  Every  color  class  of  Gk+i  =  ( V,  E- Bx  —  B2  — . . .  -  £*) 
is  the  union  of  at  least  two  color  classes  of  Gx. 

Corollary  4.5.1:  The  number  of  color  classes  for  Gk+ 1  is  at  most  half  the  number 
of  color  classes  for  G. 

Proof:  Since  the  B,  form  a  maximal  independent  set  under  a  every  color  class 
of  G  which  is  not  one  of  the  £,-  must  be  adjacent  to  one  of  the  By.  Because  of 
Lemma  4.3  it  will  be  merged  with  some  other  color  class.  0 

The  input  to  our  algorithm  is  an  undirected  graph  G  =  (V,  E).  The  output  is 
either  G,  a  transitive  orientation  of  G,  or  an  indication  that  G  has  no  transitive 
orientation.  With  G\  initialized  to  be  G,  and  G  initially  equal  to  (V,  0),  if  no 
inconsistent  implication  class  is  found  in  the  first  iteration,  the  algorithm  proceeds 
in  iterations  as  long  as  the  set  of  color  classes  is  non-empty. 

Each  iteration  consists  of  the  following  four  steps: 

1.  Determine  the  color  classes  of  G{.  This  can  be  done  using  standard  parallel 
techniques  such  as  solving  2-SAT  formulae  or  finding  connected  components 
[22]. 

2.  Determine  the  a  relation  on  color  classes. 

3.  Use  a  maximal  independent  set  subroutine  [15,17]  to  obtain  a  maximal 
independent  set  M  of  color  classes. 

4.  In  parallel,  for  each  Bj  in  M,  delete  Bj  from  G<,  and  add  Bj  or  Bj  1  to  G. 

Step  3  is  the  most  expensive  of  these  steps,  requiring  0(log2n)  time  and  n4 
processors.  The  logn  iterations  can  therefore  be  done  in  0(log3n)  time  on  n4 
processors. 

5  Maximum  Matching 

The  two  processor  scheduling  and  transitive  orientation  algorithms  can  be  used 
to  find  maximum  matchings  on  co-comparability  graphs.  To  find  a  maximum 
matching  on  the  co-comparability  graph  G  =  (V,  E),  first  create  the  comparability 
graph  Gy  the  complement  of  G.  Applying  the  transitive  orientation  routine 
converts  G  into  a  precedence  graph.  An  optimal  two  processor  schedule  can  be 
found  for  the  precedence  graph  using  our  scheduling  algorithm.  We  will  see  below 
that  the  pairs  of  tasks  scheduled  together  form  a  maximum  matching  of  G. 

Let  S  be  any  optimal  two  processor  schedule  for  3.  A  task-pair  of  5  is  a 
pair  of  tasks  mapped  to  the  same  timestep  by  5.  Since  there  are  no  precedence 
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relationships  between  tasks  in  a  task-pair,  the  set  of  task-pairs  of  S  form  a 
matching  in  G.  Because  5  is  an  optimal  schedule,  no  schedule  has  more  task- 
pairs. 

A  task  is  available  at  some  point  in  a  schedule  if  it  can  be  executed  without 
violating  the  precedence  constraints. 

Lemma  5.1:  If  a  co-comparability  graph  G  has  a  perfect  matching  then  3  has  a 
schedule  where  every  task  is  in  a  task-pair. 

Proof:  We  say  a  pair  of  tasks  is  mated  if  the  pair  is  in  the  perfect  matching. 
Construct  the  schedule  (and  modify  the  ’’mated”  relationship)  iteratively  as 
follows: 

If  two  mated  tasks  are  both  available,  schedule  one  such  mated  pair. 
Otherwise  find  two  mated  pairs,  (f,f')  and  (s,s'),  such  that  t  and  s 
are  available  and  there  is  no  precedence  relationship  between  t'  and  s/. 
Schedule  t  with  s  and  mate  t'  with  s'. 

Note  that  there  are  never  precedence  constraints  between  a  pair  of  mated  tasks. 
This  method  clearly  takes  two  tasks  each  timestep  and  does  not  violate  the 
precedence  constraints.  What  we  must  show  is  that  it  always  constructs  a  schedule 

for  3. 

Assume  to  the  contrary  that  at  some  point  it  does  not  find  a  pair  of  tasks  to 
schedule.  Let  U  be  the  set  of  available  tasks  and  U'  be  the  set  of  tasks  which  are 
mated  to  tasks  in  U.  Since  the  method  fails,  U  n  U'  =  0  and  there  is  a  precedence 
relationship  between  every  pair  of  tasks  in  V  (i.e.  U'  is  totally  ordered).  Let  t' 
be  the  task  in  U'  which  precedes  all  other  tasks  in  U' .  Since  t'  &  U ,  there  must 
be  some  t  €  U  such  that  t  -<  t'.  However,  by  the  transitivity  of  precedence,  t  also 
precedes  its  mate  -  contradiction.  0 

Lemma  5.2:  Let  G  —  (V,  -<)  be  a  precedence  graph  and  S  a  two  processor 
schedule  for  3  =  (V  —  {<},  -<).  A  single  timestep  containing  t  can  be  inserted 
into  S  yielding  a  schedule  for  3. 

Proof:  Let  t'  be  the  last  predecessor  of  t  in  S.  Insert  task  t  immediately  after 
the  timestep  containing  t'.  Obviously  there  are  no  precedence  conflicts  between  t 
and  its  predecessors.  Since  5  is  valid  schedule,  there  are  no  precedence  conflicts 
between  tasks  in  V  —  {t}.  Therefore  any  precedence  conflict  which  is  violated  is 
of  the  form  t  -<t.  By  transitivity  t'  also  precedes  t,  so  t  comes  strictly  after  t'  in 
S.  Since  t  is  inserted  immediately  after  t',  task  t  appears  before  i  in  the  modified 
schedule.  □ 
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Let  M  be  the  tasks  in  a  maximum  matching  on  G.  The  above  Lemmas  suggest  a 
way  to  obtain  a  schedule,  S,  for  3  =  (V,  X)  where  the  paired  tasks  of  S  are  precisely 
the  tasks  in  M.  Start  by  finding  an  optimal  schedule,  S'  for  the  subgraph  of  G 
induced  by  M  and  add  the  tasks  mV  —  M  one  at  a  time.  One  JVC  implementation 
of  this  algorithm  involves  bucket  sorting  the  tasks  mV  —  M  based  on  which  task- 
pair  of  S'  they  follow.  By  topologically  sorting  the  tasks  within  each  bucket  we 
can  quickly  determine  where  each  task  should  be  inserted. 

Theorem  6:  The  task-pairs  of  any  optimal  schedule  for  G  form  a  maximum 
matching  on  G. 

Proof:  Let  M  be  the  tasks  in  some  maximum  matching  of  G.  Let  5  be  an  optimal 
schedule  for  the  subgraph  of  G  induced  by  M .  By  Lemma  5.1,  the  task-pairs  of 
S  form  a  maximum  matching  on  G.  By  Lemma  5.2  we  can  insert  the  other  tasks 
of  G  one  at  a  time  without  disturbing  the  task-pairs.  Therefore,  the  task-pairs  of 
the  resulting  schedule  for  G  form  a  maximum  matching  on  G.  Since  every  optimal 
schedule  has  the  same  number  of  task-pairs  and  the  task-pairs  of  every  schedule 
form  a  matching,  the  task-pairs  of  any  optimal  schedule  for  G  forms  a  maximum 
matching  on  G.  G 

If  G  is  not  transitively  orientable  it  may  still  be  possible  to  find  a  maximum 
matching  in  G  =  (V,E).  Assume  we  are  given  a  set  U,  consisting  of  O(logn) 
edges,  such  that  G  U  U  is  transitively  orientable.  The  following  method  finds  a 
maximum  matching  in  G. 

For  each  S'  C  U  such  that  S'  is  a  matching  find  (in  parallel)  a  maximum 
matching  in  G'  =  (V  —  {v  :  (v,v')  €  S'},  E  —  U).  A  maximum  matching  for 
G  occurs  whenever  the  cardinality  of  the  maximum  matching  for  G'  plus  |5'|  is 
maximal. 

A  graph  G  is  a  fc-nearly  comparability  graph  when: 

—  G  has  at  most  fclogn  inconsistent  implication  classes  and 

—  each  inconsistent  implication  class  of  G  is  split  into  consistent  implication 

classes  by  the  addition  of  at  most  k  edges. 

A  fc-nearly  co-comparability  graph  is  the  complement  of  a  fc-nearly  comparability 
graph. 

Let  G  be  a  fc-nearly  co-comparability  graph  (for  some  constant  fc).  The 
following  is  an  outline  of  an  JVC  algorithm  for  finding  a  maximum  matching 
in  G.  In  parallel  examine  each  set,  T,  of  fc  edges  not  in  G.  Determine  which 
inconsistent  implication  classes  are  split  when  T  is  added  to  G.  For  each 
inconsistent  implication  class  A,  pick  any  set  of  fc  edges  which  splits  A  into 
consistent  implication  classes.  At  most  k2logn  edges  are  picked,  so  the  above 
method  can  be  used  to  find  a  maximum  matching  for  G. 


6  Conclusions 


Although  the  algebraic  approach  was  used  to  obtain  the  first  parallel  matching 
algorithms  [14,18],  these  are  randomized  algorithms.  It  is  interesting  that  we 
can  obtain  deterministic  matching  algorithms  for  wide  classes  of  graphs  using  a 
purely  combinatorial  approach.  Perhaps  the  combinatorial  approach  will  yield 
deterministic  algorithms  for  matching  on  other  classes  of  graphs  as  well. 

It  was  surprising  how  much  more  difficult  computing  the  actual  schedule  was 
than  simply  computing  its  length.  In  higher  complexity  classes  such  as  V  and  MV 
it  is  often  easy  to  go  from  the  decision  problem  to  computing  an  actual  solution, 
because  of  self  reducibility.  However  this  does  not  necessarily  seem  to  be  the 
case  for  parallel  complexity  classes.  To  support  this  observation  we  note  that 
the  random  MC  algorithm  for  finding  the  cardinality  of  a  maximum  matching  is 
much  simpler  than  the  random  MC  algorithm  for  determining  an  actual  maximum 
matching  [13]. 

There  are  several  open  problems  related  to  scheduling.  We  are  attempting  to 
extend  our  two  processor  result  to  the  case  when  the  tasks  have  nonuniform  start 
times  and/or  deadlines.  When  the  precedence  constraints  are  restricted  to  in-trees 
or  out-trees  there  are  parallel  algorithms  for  generating  schedules  on  an  arbitrary 
number  of  processor  [3,11].  It  is  an  open  problem  whether  interval-ordered  tasks 
[19]  can  be  scheduled  in  parallel. 

One  variant  of  the  two  processor  problem  that  we  know  to  be  .ATP-complete 
(by  reduction  from  the  clique  problem)  allows  incompatibility  edges  as  well  well  as 
precedence  constraints.  When  there  is  an  incompatibility  constraint  between  two 
tasks  they  can  be  executed  in  either  order,  but  not  concurrently.  Incompatibility 
constraints  arise  naturally  when  two  or  more  tasks  need  the  same  resource,  such 
as  special  purpose  hardware  or  a  database  file. 
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